An Explicit Mapping for Kernel Data Analysis and Application to Text Analysis
نویسندگان
چکیده
Kernel data analysis is now becoming standard in every application of data analysis and mining. Kernels are used to represent a mapping into a high-dimensional feature space, where an explicit form of the mapping is unknown. Contrary to this common understanding, we introduce an explicit mapping which we consider standard. The reason why we use this mapping is as follows. (1) the use of this mapping does not lose any fundamental information in kernel data analysis and we have the same formulas in every kernel methods. (2) Usually the derivation becomes simpler by using this mapping. (3) New applications of the kernel methods become possible using this mapping. As an application we consider an example of text mining where we use fuzzy c-means clustering and cluster centers in the high-dimensional space and visualize the centers using kernel principal component analysis. Keywords— Kernel data analysis, fuzzy clustering, explicit mapping, text mining
منابع مشابه
A prediction distribution of atmospheric pollutants using support vector machines, discriminant analysis and mapping tools (Case study: Tunisia)
Monitoring and controlling air quality parameters form an important subject of atmospheric and environmental research today due to the health impacts caused by the different pollutants present in the urban areas. The support vector machine (SVM), as a supervised learning analysis method, is considered an effective statistical tool for the prediction and analysis of air quality. The work present...
متن کاملA prediction distribution of atmospheric pollutants using support vector machines, discriminant analysis and mapping tools (Case study: Tunisia)
Monitoring and controlling air quality parameters form an important subject of atmospheric and environmental research today due to the health impacts caused by the different pollutants present in the urban areas. The support vector machine (SVM), as a supervised learning analysis method, is considered an effective statistical tool for the prediction and analysis of air quality. The work present...
متن کاملPsychometric Analysis of Hypertension Self-Management Behaviors Questionnaire; an Application of Intervention Mapping Approach in Questionnaire Development
Aims: High blood pressure is one of the common main preventable risk factors for many diseases. This study aimed to psychometric properties of the cognitive determinants of hypertension self-management questionnaire among Iranian hypertensive patients based on the Intervention Mapping approach. Instrument & Methods: This psychometric study was conducted in Abadan in 2019. Content Validity Rati...
متن کاملNon-Euclidean independent component analysis and Oja's learning
In the present contribution we tackle the problem of nonlinear independent component analysis by non-Euclidean Hebbian-like learning. Independent component analysis (ICA) and blind source separation originally were introduced as tools for the linear unmixing of the signals to detect the underlying sources. Hebbian methods became very popular and succesfully in this context. Many nonlinear ICA e...
متن کاملBiosynthesis of Ag Nanoparticles at Ziziphus Jujuba Kernel Substrate using Tilia platyphyllos Extract: Catalytic Activity for Reduction of Organic Dyes
For the first time the extract of the plant of Tilia platyphyllos was used to green synthesis of Ag nanoparticles (NPs) supported on Ziziphus jujuba kernel as an environmentally benign support. Ag NPs/ Ziziphus jujuba kernelas an effective catalyst was prepared through reduction of Ag+ions using Tilia platyphyllos extractas the reducing and capping agent and Ag NPs immobilization...
متن کامل